Data Mining and Model Simplicity : A Case Study in DiagnosisGregory
نویسندگان
چکیده
We describe the results of performing data mining on a challenging medical diagnosis domain, acute abdominal pain. This domain is well known to be diicult, yielding little more than 60% pre-dictive accuracy for most human and machine di-agnosticians. Moreover, many researchers argue that one of the simplest approaches, the naive Bayesian classiier, is optimal. By comparing the performance of the naive Bayesian classiier to its more general cousin, the Bayesian network clas-siier, and to selective Bayesian classiiers with just 10% of the total attributes, we show that the simplest models perform at least as well as the more complex models. We argue that simple models like the selective naive Bayesian classiier will perform as well as more complicated models for similarly complex domains with relatively small data sets, thereby calling into question the extra expense necessary to induce more complex models.
منابع مشابه
Approximate resistivity and susceptibility mapping from airborne electromagnetic and magnetic data, a case study for a geologically plausible porphyry copper unit in Iran
This paper describes the application of approximate methods to invert airborne magnetic data as well as helicopter-borne frequency domain electromagnetic data in order to retrieve a joint model of magnetic susceptibility and electrical resistivity. The study area located in Semnan province of Iran consists of an arc-shaped porphyry andesite covered by sedimentary units which may have potential ...
متن کاملUsing Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach
Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...
متن کاملComparison of Three Decision-Making Models in Differentiating Five Types of Heart Disease: A Case Study in Ghaem Sub-Specialty Hospital
Introduction: cardiovascular diseases are becoming the main cause of mortality and morbidity in most countries. This research goal was to predict the types of heart diseases for more accurate diagnosis by data mining and neural network technics. Method: This research was an applied-survey study and after data preprocessing, three approaches of neural network, decision making tree and Bayes simp...
متن کاملComparison of Three Decision-Making Models in Differentiating Five Types of Heart Disease: A Case Study in Ghaem Sub-Specialty Hospital
Introduction: cardiovascular diseases are becoming the main cause of mortality and morbidity in most countries. This research goal was to predict the types of heart diseases for more accurate diagnosis by data mining and neural network technics. Method: This research was an applied-survey study and after data preprocessing, three approaches of neural network, decision making tree and Bayes simp...
متن کاملGeostatistical estimation to delineate oxide and sulfide zones using geophysical data; a case study of Chahar Bakhshi vein-type gold deposit, NE Iran
Delineation of oxide and sulfide zones in mineral deposits, especially in gold deposits, is one of the most essential steps in an exploration project that has been traditionally carried out using the drilling results. Since in most mineral exploration projects there is a limited drilling dataset, application of geophysical data can reduce the error in delineation of the sulfide and oxide zones....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996